Model Selection

128K Long Context

# 128K Long Context

Devstral Small 2505 Fp8

Devstral is a large language model agent for software engineering tasks developed by Mistral AI in collaboration with All Hands AI, excelling in exploring codebases with tools, editing multiple files, and driving software engineering agents.

Large Language Model

Safetensors Supports Multiple Languages

Devstral Small 2505

Devstral is an intelligent large language model specifically designed for software engineering tasks, jointly developed by Mistral AI and All Hands AI. It excels in code exploration, multi-file editing, and driving software engineering agents.

Large Language Model

Safetensors Supports Multiple Languages

Llama 3.1 Nemotron Nano 4B V1.1

Llama-3.1-Nemotron-Nano-4B-v1.1 is a large language model derived from Llama 3.1 8B through compression, optimized for inference efficiency and task execution, suitable for local deployment on a single RTX GPU.

Large Language Model

Transformers English

Devstral Small 2505 Bnb 4bit

Devstral is an intelligent large language model specifically designed for software engineering tasks, developed in collaboration by Mistral AI and All Hands AI. It excels in codebase exploration, multi-file editing, and driving software engineering agents.

Large Language Model

Safetensors Supports Multiple Languages

Medgemma 27b Text It GGUF

MedGemma is a series of medical-specialized AI models optimized based on Gemma 3, with the 27B version focusing on medical text comprehension and reasoning tasks

Large Language Model

Devstral Small 2505 Gguf

Devstral is an intelligent large language model specifically designed for software engineering tasks, jointly developed by Mistral AI and All Hands AI. It excels in code exploration, editing, and driving software engineering agents.

Large Language Model Supports Multiple Languages

Typhoon2.1 Gemma3 4b

Thai large language model (instruction-tuned version) with 4 billion parameters, 128K context length, and function calling capability

Large Language Model

Typhoon2.1 Gemma3 12b

Typhoon2.1-Gemma3-12B is a 12-billion-parameter Thai large language model based on the Gemma3 architecture, supporting 128K context length and function calling capabilities.

Large Language Model

Gemma 3 12b It Qat Int4 GGUF

Gemma 3 is Google's lightweight open model series based on Gemini technology. The 12B version employs Quantization-Aware Training (QAT) technology, supports multimodal input, and features a 128K context window.

Phi 4 Mini Instruct.gguf

Phi-4-mini-instruct is a lightweight open-source model focused on high-quality, reasoning-rich data, supporting a context length of 128K tokens.

Large Language Model Other

Gemma 3 27b It Qat GGUF

Gemma 3 is a lightweight open model series built by Google based on Gemini technology, supporting multimodal input and text output, featuring a 128K large context window and support for 140+ languages.

Text-to-Image English

Gemma 3 12b It Qat Int4

Gemma 3 is a lightweight open model series from Google, built on the research and technology used to create Gemini models. The 12B version is an instruction-tuned multimodal model supporting text and image inputs to generate text outputs.

R01 Gemma 3 1b It

Gemma 3 is a lightweight open-source multimodal model introduced by Google, built on the same technology as Gemini, supporting text and image inputs to generate text outputs.

Transformers English

Gemma 3 1b It Qat Q4 0 Unquantized

Gemma 3 is a lightweight open-source multimodal model series developed by Google, built on Gemini technology, supporting text and image inputs with text outputs. The 1B version has undergone instruction tuning and quantization-aware training (QAT), making it suitable for deployment in resource-constrained environments.

Gemma 3 12b It Qat Q4 0 Unquantized

Gemma 3 is Google's lightweight open-source multimodal model series based on Gemini technology, supporting text and image inputs with text outputs. The 12B version undergoes instruction tuning and quantization-aware training (QAT), making it suitable for deployment in resource-limited environments.

Gemma 3 4b It Qat Q4 0 GGUF

Gemma is a family of lightweight, cutting-edge open models introduced by Google, built on the same research and technology as the Gemini models. Supports text and image inputs and generates text outputs.

Gemma 3 27b It Qat Autoawq

Gemma 3 is a lightweight, cutting-edge open model series from Google, built on the same technology as Gemini, supporting multimodal input (text/image) and text output. The 27B version significantly reduces memory requirements through quantization-aware training.

Gemma 3 12b It Qat Autoawq

Gemma 3 is Google's lightweight open model series based on Gemini technology, supporting multimodal input and text output.

Gemma 3 27b It Qat Q4 0 Gguf

Gemma 3 is a lightweight open-source multimodal model series by Google, supporting text and image inputs with text generation capabilities. This version is a 27B parameter instruction-tuned model using quantization-aware training, offering lower memory requirements while maintaining near-original quality.

Gemma 3 12b It Qat Q4 0 Gguf

Gemma 3 is a lightweight open model built by Google based on Gemini technology, supporting text and image inputs to generate text outputs. The 12B version is instruction-tuned and suitable for various generation and comprehension tasks.

Openhands Lm 32b V0.1 AWQ

OpenHands LM is a 32B-parameter open-source programming model, specifically designed for software development agents. It supports local deployment and excels in software engineering tasks.

Large Language Model

Safetensors English

Gemma 3 4b It Llamafile

Gemma 3 is a lightweight open-source model series launched by Google, built on Gemini technology, supporting multimodal input and text output.

Gemma 3 27b It Int4 Gguf

Gemma 3 is a lightweight cutting-edge open model family from Google, built on the same research technology as Gemini models. Supports text/image input and text output, offering both pretrained and instruction-tuned weight versions.

Gemma 3 12b It Int4 Gguf

Gemma 3 is a lightweight multimodal open model from Google that supports text and image inputs with text outputs, featuring a 128K large context window and support for 140+ languages.

Gemma 3 12b It Int4 Awq

Gemma is Google's lightweight cutting-edge open-source model family, built using the same research technology as Gemini models. Gemma 3 is a multimodal model supporting text/image input and text output.

Gemma 3 12b Pt Unsloth Bnb 4bit

Gemma 3 is a lightweight, advanced open model series launched by Google, built on the same research technology as Gemini, supporting multimodal input and text output.

Transformers English

Gemma 3 12b It Gguf

Gemma-3 is a lightweight multimodal open model launched by Google, supporting text and image inputs to generate text outputs. Built on the research and technology behind the Gemini model, it features a 128K large context window and supports over 140 languages.

Gemma 3 12b Pt Qat Q4 0 Gguf

Gemma 3 is a lightweight open-source multimodal model from Google, supporting text and image input with text output, featuring a 128K ultra-long context window and support for 140+ languages.

Gemma 3 4b It Gguf

Gemma 3 is a lightweight open-source multimodal model introduced by Google, supporting image and text inputs to generate text outputs.

Gemma 3 4b Pt Qat Q4 0 Gguf

Gemma 3 is a lightweight open model series launched by Google, built on the same technology as Gemini, supporting multimodal input and text output.

Gemma 3 12b It Qat Q4 0 Gguf

Gemma 3 is Google's lightweight cutting-edge open-source multimodal model supporting image-text input and text output, featuring a 128K context window and 140+ language support.

Gemma 3 1b Pt Qat Q4 0 Gguf

Gemma is a family of lightweight, cutting-edge open models from Google, built on the same research and technology as the Gemini models. The 1B version is a pretrained base model in GGUF format with Quantization-Aware Training (QAT).

Gemma 3 12b It GGUF

Gemma 3 is a lightweight open-source multimodal model series launched by Google, built on the same technology as Gemini, supporting text and image inputs and generating text outputs

Gemma 3 4b It GGUF

Gemma 3 is a lightweight open-source multimodal model from Google, supporting text and image inputs with text outputs, featuring a 128K context window and support for 140+ languages.

Gemma is a lightweight cutting-edge open-source multimodal model series launched by Google, built on the technology used to create Gemini models, supporting text and image inputs to generate text outputs.

Gemma is a lightweight cutting-edge open model series launched by Google, built on the same technology as Gemini, supporting multimodal input and text output.

Gemma is a series of lightweight, cutting-edge open models launched by Google, built on the same research and technology used to create Gemini models.

Phi 4 Mini Instruct Abliterated

Phi-4-mini-instruct is a lightweight open-source model built on synthetic data and curated public websites, focusing on high-quality data with strong reasoning capabilities. It supports a 128K token context length and is enhanced through supervised fine-tuning and direct preference optimization to ensure precise instruction following and safety.

Large Language Model

Transformers Supports Multiple Languages

Gemma is a series of lightweight, advanced open models from Google, built using the same research and technology as the Gemini models.

Gemma is a lightweight, advanced open model series launched by Google, built on the same research and technology as Gemini. Gemma 3 is a multimodal model capable of processing both text and image inputs to generate text outputs.

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase